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Abstract. We present an exposition of the theory of M -automata and 
G-automata, or finite automata augmented with a multiply-only register 
storing an element of a given monoid or group. Included are a number of 
new results of a foundational nature. We illustrate our techniques with 
a group-theoretic interpretation and proof of a key theorem of Chomsky 
and Schiitzenberger from formal language theory. 

1. Introduction 

In recent years, both computer scientists and pure mathematicians have 
become increasingly interested in the class of M -automata, or finite state 
automata augmented with a memory register which stores at any given 
time an element of a given monoid M. The register is initialised with the 
identity element of the monoid; while reading an input word the automaton 
can modify the register contents by multiplying by elements of the monoid. 
A word is accepted by the automaton if, having read the entire word, the 
automaton reaches a final state, with the register returned to the identity 
element. 

Such automata have arisen repeatedly, both explicitly and implicitly, in 
the theory of computation. For example, the blind n-counter machines stud- 
ied by Greibach [TU] are simply Z"-automata. Related examples have also 
been studied by Ibarra, Sahni and Kim [T3]. M-automata are also equiva- 
lent to a class of regulated grammars, known as regular valence grammars 
[8]. More recently, there has been increasing interest in this idea from pure 
mathematicians, especially in the case that the register monoid is a group. 
An area of lasting interest in combinatorial group theory is the connection 
between structural properties of infinite discrete groups, and language the- 
oretic properties of their word problems. Results of Gilman and Shapiro [9], 
of Elston and Ostheimer [7] and of the author p3] have demonstrated that 
M-automata can play a useful role in this area. Moreover, recent research 
of the author, Silva and Steinberg [15] has established a connection between 
the theory of G-automata and the rational subset problem for groups, allow- 
ing language-theoretic results to be applied to decision problems in group 
theory. 

One aim of the present paper is to provide a self-contained introduction 
to the theory of this important area, in a form intelligible both to pure 
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mathematicians and to computer scientists. In doing so, we aim to make 
explicit and transparent the connections between the computational and the 
algebraic approaches to M-automata and G-automata. This is especially 
important since, to date, group theorists, semigroup theorists and computer 
scientists have all worked on this topic, often unaware of crucial results 
from the other discipline. Another objective is to establish a number of new 
results of a foundational nature. 

In addition to this introduction, this paper comprises three sections. Sec- 
tion [2] briefly recalls some necessary preliminaries, and gives suggested ref- 
erences to more detailed treatments. Section [3] introduces M-automata and 
the families of languages they define. We also explain their relationship to 
the theory of rational transductions, and study the connection between al- 
gebraic properties of monoids and closure properties of the language classes 
they define. In Section 0J we proceed to show how M-automata techniques 
can be used to obtain a group-theoretic interpretation and proof of one of the 
most important results in formal language theory - namely, the Chomsky- 
Schiitzenberger theorem characterising context-free languages as the images 
under rational transductions of 2-sided Dyck languages [2]. 



2. Preliminaries 

In this section, we introduce the basic definitions which will be required 
in this paper. We begin with a very brief introduction to formal languages 
and automata; a more comprehensive exposition can be found in any of the 
numerous texts on the subject, such as [TT]. We assume a familiarity with 
some basic definitions from algebra, such as semigroups and monoids |12j . 
groups, generating sets and presentations [EJUS]. More specialist notions 
from algebra will be defined as and when they are needed. 

Let E be a finite set of symbols, called an alphabet. A word over E is 
a finite sequence of zero or more symbols from E; the unique empty word 
of length zero is denoted e. The set of all words over E forms a monoid 
under the operation of concatenation; this is called the free monoid on E 
and denoted E*. A language over E is a set of words over S, that is, a subset 
of the free monoid E*. 

A finite automaton over a monoid M is a finite directed graph, possibly 
with loops and multiple edges, with each edge labelled by element of M, 
together with a designated initial vertex and a set of designated terminal 
vertices. The vertices and edges of an automaton are often called states 
and transitions respectively. The labelling of edges extends naturally to 
a labelling of (directed) paths by elements of M. The subset accepted or 
recognised by the automaton is the set of all elements of M which label paths 
between the initial vertex and some terminal vertex. A subset recognised 
by some automaton is called a rational subset of M. Notice that finitely 
generated submonoids of M are examples of rational subsets. 

Of particular interest is the case where M = E* is a free monoid on an 
alphabet E, so that the automaton accepts a set of words over E, that is, 
a language over E. A language accepted by such an automaton is called 
a rational language or a regular language. The formal study of languages 
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in general, and of regular languages in particular, is of fundamental impor- 
tance in theoretical computer science, and increasingly also in combinatorial 
algebra. 

Another interesting case is that where M = X* x Q* is a direct product of 
free monoids. Such an automaton is called a finite transducer from X* to Q*; 
it recognises a relation, termed a rational transduction. Relations between 
free monoids, and rational transductions in particular, are a powerful tool 
for studying relationships between languages. If R C X* x $7* then we say 
that the image of a language L C X* under R is the language of all words 
v G O* such that (u, v) E R for some u E L. We say that a language if is 
a rational transduction of a language L if is the image of L under some 
rational transduction. For a detailed exposition of the theory of rational 
transductions, see pp. 



3. M-AUTOMATA AND G- AUTOMATA 

In this section, we introduce the definitions and some basic properties of 
M-automata and G-automata. Let M be a monoid with identity 1 and X 
a finite alphabet. An M -automaton over X is a finite automaton over the 
direct product monoid M x X*. For simplicity, we assume that the edges 
are labelled by elements of M x (X U {e}). We identify the free monoid X* 
with its natural embedding into M x X* as {1} x X*; thus, a word m£S* 
is accepted by the automaton if there is a path from the initial vertex to 
a terminal vertex labelled (l,w) € M x X*. The language accepted by the 
automaton is the set of all words in X* accepted by the automaton; it is the 
intersection of the subset accepted with the embedded copy {1} x X* of X*. 
We denote by F(M) the family of all languages accepted by M-automata. 

From a mathematical perspective, then, the theory of M-automata can 
be viewed as an attempt to understand certain properties of a space (the free 
monoid X*) by embedding it into a larger space (M x X*) with more struc- 
ture; in this sense, it is very roughly analoguous to the embedding of the 
real numbers into the complex numbers. From a computational perspective, 
an M-automaton can be thought of as a (non-deterministic) finite automa- 
ton augmented with an extra memory register, which stores at any point 
an element of the monoid M. The register is initialised with the identity 
element of the monoid, and at each stage in its operation, the automaton 
can modify the contents of the register by multiplication on the right by 
some element of the monoid M. Of course it can also leave the register 
unchanged, simply by multiplying by the identity. The automaton cannot 
read the register during operation, but the contents act as an extra barrier 
to acceptance — a word is accepted only if reading it can result in reaching 
a final state in which the register value has returned to the identity. 

As an example, recall that a blind n-counter automaton is a finite au- 
tomaton augmented with n registers, each of which stores a single integer 
value [10]. The registers can be incremented and decremented but not read; 
they are initialised to zero, and a word is accepted exactly if, when reading 
it, the automaton can reach a final state with all registers returned to zero. 
In view of the discussion above, it is clear that a blind n-counter automaton 
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is essentially the same thing as a Z™-automaton, where Z denotes the group 
of integers under addition, that is, the infinite cyclic group. 

Notice that we have not required that the register monoid be finitely 
generated. However, the following elementary observation will often allow 
us to restrict attention to the case in which it is. 

Proposition 1. Let M be a monoid, and suppose that L is accepted by an 
M -automaton. Then there exists a finitely generated submonoid N of M 
such that L is accepted by a N -automaton. 

Proof. Since an M-automaton has finitely many edges, only finitely many 
elements of M can feature as the left-hand component of edge labels in 
a given automaton. Clearly, the register can only ever hold values in the 
submonoid N of M generated by these elements, so it suffices to view the 
automaton as an ^-automaton. □ 

Notwithstanding Proposition [H it is occasionally useful to consider the 
class of all M-automata where M is not finitely generated, since the corre- 
sponding class of languages (the union of the classes corresponding to the 
finitely generated submonoids of M) may not be defined by a single finitely 
generated monoid. See, for example, [91 Theorem 6.2], for an application of 
this approach. 

If M is a monoid generated by a set X, we say that the identity language 
Wx(M) of M with respect to X is the set of all words over X representing 
the identity. In the case M is a group, the identity language is traditionally 
called the word problem of the group; this terminology is justified by the fact 
that the membership problem of this language is algorithmically equivalent 
to the problem of deciding whether two given words represent the same 
element of the group, that is, to the word problem in the sense of universal 
algebra. In a general monoid there is no such equivalence, and so the term 
word problem is less appropriate. 

The following simple observation has been made by several authors (see, 
for example, [9]), but apparently overlooked by a number of others. It allows 
us to apply many standard results from the theory of formal languages to 
the study of M-automata. 

Proposition 2. Let L be a language and M a finitely generated monoid. 
Then the following are equivalent: 

(i) L is accepted by an M-automaton; 

(ii) L is a rational transduction of the identity language of M with respect 
to some finite generating set; 

(iii) L is a rational transduction of the identity language of M with respect 
to every finite generating set. 

Proof. We begin by proving the equivalence of (i) and (ii) . First suppose (i) 
holds, and let A be an M-automaton accepting L. Let S be the (necessarily 
finite) set of elements of M which occur on the left-hand-side of edge labels 
in A. Extend S to a finite generating set X for M. Now A can be viewed 
as a finite transducer from X* to S*. It follows easily from the relevant 
definitions that the image of Wx(M) is exactly the language L, so that (ii) 
holds. 
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Conversely, suppose (ii) holds. Then there is a finite generating set X 
for M and a finite transducer A from X* to S* such that L is the image of 
Wx(M) under A. We obtain from A an M-automaton, by replacing each 
edge label (w, x) with (m, x) where m is the element of M represented by 
w G X* . Again, it follows easily from the definitions that the resulting 
M-automaton accepts exactly the language L. 

Since the monoid is assumed to be finitely generated, it is immediate 
that (iii) implies (ii). It remains only to show that (ii) implies (iii). Let Y 
and X be finite generating sets for a monoid M. For each symbol a G X, 
choose a word w a G Y* representing the same element of M. Now let R 
be the submonoid of Y* x X* generated by the (finitely many) pairs of 
the form (w a ,a). R is a finitely generated submonoid, and hence also, by 
our observations above, a rational transduction. Now if L is the image of 
Wx(M) under a rational transduction S then the relational composition 

R o S = {(u, w) | (u, v) G R, (v, w) G S for some v G X*} 

is a rational transduction [J, Theorem II. 4. 4] , and it is easily verified that L 
is the image of Wy(M) under R o S. □ 

Proposition [2] tells us that the theory of M-automata can be viewed as a 
special case of the well-established field of rational transductions. Indeed, we 
can easily and profitably translate a large body of existing theory concerning 
rational transductions into the M-automaton setting. For two main reasons, 
however, the study of M-automata retains a distinct flavour, and remains of 
interest in its own right. Firstly, the structure of the register monoid can be 
used to prove interesting things about the accepting power of M-automata. 
Secondly, M-automata can be used to gain insight into computational and 
language-theoretic aspects of monoids. Both of these factors have special 
weight in the case that the register monoid is a group, with all the extra 
structure that entails. 

The following result, which has been observed independently by several 
authors [HE], is an immediate corollary of Proposition [2] together with 
the fact that rational transductions are closed under composition [1, Theo- 
rem III. 4. 4]. 

Corollary 3. Let M and N be finitely generated monoids. Then the identity 
language of N is accepted by an M-automaton if and only if every language 
accepted by an N -automaton is accepted by an M-automaton. 

We end this section with a discussion of closure properties of language 
families of the form F(M). First, we observe that every language class of 
the form F(M) is easily seen to be closed under finite union, as a simple 
consequence of non-determinism. On the other hand, a class F(M) need not 
be closed under intersection. The following theorem will allow us to provide 
a straightforward characterisation of when such a class is intersection-closed. 
The converse part was essentially proved by Mitrana and Stiebe [19] in the 
case that the register monoids are groups. Recall that a morphism between 
free monoids is called alphabetic if it maps each letter of the domain alphabet 
to either a single letter or the empty word; one language is said to be an 
alphabetic morphism of another if the former is the image of the latter under 
an alphabetic morphism of free monoids. 
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Theorem 4. Let Ml, M2, ■ ■ ■ M n be monoids. Then a language is accepted 
by an (Mi x • • • x M n )- automaton if and only if it is an alphabetic morphism 
of a language of the form L\ Pi • • • Pi L n where each Li is accepted by an 
M{- automaton. 

Proof. Suppose first that L C £* is accepted by an (Mi x • • • x M n )- 
automaton. It follows easily from Proposition [1] that L is accepted by an 
(M{ x • • • x M^)-automaton where each M[ is a finitely generated submonoid 
of Mj. For each M[ choose a finite generating set Xi and let X = X\\J - ■ -UX n 
so that X is a finite generating set for M{ x • • • x M' n . Now by Proposition [2 
L is the image of the identity language W = Wx{M[ x • • • x M' n ) under some 
rational transduction p C X* x £*. 
We claim that W can be written as 

w = k x n • • • n K n 

where each Ki £ X* lies in ¥{M-). Let Ki be the set of all words w £ X* such 
that w represents a tuple in M{ x • • • x M' n with the identity element 1 in the 
ith position. It is readily verified that Ki is accepted by an Mj'-automaton 
with a single (initial and terminal) state q, and a loop at q labelled (x, g) 
whenever the letter x £ X represents a tuple with g in the ith position. Now 
a word w represents the identity of M{ x • • • x M' n if and only if w lies in Ki 
for all i, so that W is the intersection of the Ki as required. 

Now by [1, Theorem III. 4.1], there exists an alphabet Z, a regular lan- 
guage R C Z* and two alphabetic morphisms a : Z* —* X* and (3 : Z* — > S* 
such that 

L = Wp = (Wa' 1 n R)/3 = {{Kx n • • • n K n ) a - 1 n R)/3. 

It is easy to check that inverse morphisms distribute over intersection, so 
that 

{Kx n • • • n K n )or x = Kxor 1 n • • • r\K n or x . 

Thus we obtain 

L = {{K 1 a~ 1 n---nK n a- 1 )nR)(] = {{K l a- 1 C\R)C\---C\{K n a- l C\R))l3. 

Each F(Mj') is closed under rational transductions, and hence also under 
inverse morphisms and intersection with regular languages. It follows that 

{Ka^HR) G ¥{M[) C F(M) 

for each i, so setting Li = {Kia^ 1 D R) completes the proof of the direct 
implication. 

For the converse, it is easy to check that the graph of an alphabetic 
morphism between free monoids is a ratonal transduction; it follows that 
the family F(Mi x • • • x M n ) is closed under alphabetic morphisms. Thus, 
it suffices to suppose L = L\ Pi • • • V\L n where each L{ is in F(Mj), and show 
that L € F(Mi x • • • x M n ). For this, it is clearly sufficient to prove the case 
n = 2 and then apply induction. Suppose, then, that L\,L2 Q X* and that 
each Li is accepted and by an Mj-automaton Ai with state set Qi. Recall 
that the edges of each A4 are labelled by elements of M[ x (S U {e}). We 
define an (Mi x M2 )-automaton A with: 

• state set Q\ x Q 2 ; 
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• an edge from (s,u) to (t, v) labelled ((x,y),a) whenever A\ has an 
edge from s to i labelled (x, a) and A2 has an edge from u to v 
labelled (y, a) for some letter a € E; 

• an edge from (s,u) to (i, u) labelled ((x, l),e) whenever Ai has an 
edge from s to t labelled (2, e); 

• an edge from (s,u) to (s,v) labelled ((l,y),e) whenever A2 has an 
edge from u to v labelled (y, e); 

• start state (po> Qo) where po and go are the start states of A\ and ^2 
respectively; and 

• final states (p, q) such that p and g are final states of A± and A2 
respectively. 

It is an easy exercise to verify that A accepts exactly the intersection L = 
Li n L 2 . □ 

Returning to our example, an elementary application of Theorem 0] is 
the following characterisation of the classes of languages accepted by blind 
n-counter automaton. 

Corollary 5. A language is recognised by a blind n-counter automaton if 
and only if it is an alphabetic morphism of the intersection of n languages 
recognised by blind 1- counter automata. 

We also obtain a characterisation of those monoids M for which F(M) is 
closed under intersection. 

Corollary 6. Let M be a finitely generated monoid. Then F(M) is closed 
under finite intersection if and only if there exists an M -automaton accepting 
the identity language of M X M. 

Proof. If there exists an M-automaton accepting the identity language of 
M x M then by Corollary EJ every language recognised by an (M x M)- 
automaton is recognised by an M-automaton. But by Theorem HI this 
means that every intersection of two M-automaton languages is recognised 
by an M-automaton. It follows by induction that F(M) is closed under finite 
intersection. 

Conversely, the identity language of M x M is certainly recognised by an 
(M x M)-automaton, and hence by Theorem[2]is an alphabetic morphism of 
the intersection of two languages in F(M). Thus, if the latter is intersection- 
closed then it contains the identity language of M x M, as required. □ 

Corollary has a particularly interesting interpretation in the case that 
the monoid M is a free group. We shall see in Section[5]below that a language 
is context-free if and only if it is recognised by a free group automaton. A 
well-known theorem of Muller and Schupp [20j , combined with a subsequent 
result of Dunwoody [5] , tells us that a finitely generated group has context- 
free word problem if and only if it is virtually free, that is, has a free subgroup 
of finite index. It follows that the fact that context-free languages are not 
intersection closed can be viewed as a manifestation of the fact that a direct 
product of virtually free groups is not, in general, virtually free. 
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4. Free Groups and Context-free Languages 

In this section we consider M-automata where M is drawn from two 
particularly significant classes of monoids; namely, polycyclic monoids and 
free groups. From the perspective of algebra, these may be considered the 
motivating examples for the subject. We observe that an important theorem 
of Chomsky and Schiitzenberger [2] has a natural interpretation in terms of 
M-automata, and show how M-automata techniques can be used to provide 
an algebraic and automata-theoretic proof of the theorem. 

We begin by recalling a basic definition from automata theory. Let X 
be a finite alphabet. A pushdown store or stack with alphabet X is a stor- 
age device which stores, at any one time, a finite but unbounded sequence 
of symbols from X, The basic operations permitted are appending a new 
symbol to the right-hand end of the sequence ("pushing" a symbol), remov- 
ing the rightmost symbol from the sequence ("popping") and reading the 
rightmost symbol on the stack. 

The possible configurations of a pushdown store are naturally modelled by 
elements of the free monoid X*. The operations of pushing and popping can 
be modelled by partial functions, defined upon subsets of X*. Specifically, 
for each symbol x S X we define a function 

P x : X* -» X* w^wx 

which models the operation pushing the symbol x onto the stack. The 
corresponding operation of popping x can only be performed when the stack 
is in certain configurations - namely, when it has an x as the rightmost 
symbol - and so is modelled by a partial function. 

Q x : X*x — * X* wx I— > w. 

The set of functions 

{P X ,Q X \x£X} 

generates a submonoid of the monoid of all partial functions on X*, under 
the natural operation of composition. This monoid, which was first explic- 
itly studied by Nivat and Perrot |21j . is called the polycyclic monoid on 
X and denoted P{X). The elements of P{X) encapsulate the various se- 
quences of operations which can be performed upon a pushdown store with 
alphabet X. The rank of P(X) is defined to be the size \X\ of the alphabet 
X; a polycyclic monoid is uniquely determined (up to isomorphism) by its 
rank. Polycyclic monoids also arise naturally in the structural theory of 
semigroups; of particular importance is that of rank 1, which is known as 
the bicyclic monoid. For more general information see |16l Section 9.3]. 

There is a natural embedding of the free monoid X* into the polycyclic 
monoid P(X), which takes each symbol x to P x . With this in mind, we shall 
identify P x with x itself. The element Q x is an inverse to P x in the sense 
of inverse semigroup theory |16j ; hence, we shall denote it x -1 . Thus, the 
monoid P{X) is simply generated by the set 

X = {x,x- 1 | x G X}. 

and we shall have no further need of the notation P x and Q x . 

The inversion operation extends to the whole of P(X), by defining (x -1 ) -1 = 
x for each x € X, and (x\ . . . x n ) _1 = x~ x . . . x^ 1 for x±, . . . , x n E X. If x 
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and y are distinct elements of X then the product xy~ is the empty partial 
function and forms a zero in the semigroup P{X). 

We shall define a pushdown automaton to be a polycyclic monoid automa- 
ton. The equivalence of this definition to the standard one is straightforward 
and well-documented, for example in [9]. For those familiar with the usual 
definition, we remark that the top symbol on the stack corresponds to the 
rightmost position of the word, and that the automaton accepts with empty 
stack. 

The attentive reader may have noticed that our polycyclic monoid model 
of the pushdown store does not provide explicitly for "reading" the con- 
tents of the stack. However, a polycyclic monoid automaton can use non- 
determinism to test the rightmost stack symbol, by attempting to pop every 
possible symbol and moving to different states depending upon which suc- 
ceeds; all but one attempt will result in the register containing a zero value, 
which effectively constitutes failure. This is an example of a more general 
phenomenon, in which the apparent blindness of an M-automaton can be 
overcome by the use of non-determinism. 

The languages accepted by pushdown automata are called context-free. 
The class of context-free languages, which also admits an equivalent defi- 
nition in terms of generating grammars [I], is one of the most important 
languages classes in computer science. 

We recall also a key notion from combinatorial group theory. Recall 
that the free group on the alphabet X is the group defined by the monoid 
presentation 

(X | xx~ l = x _1 x = 1 for all x G X). 

Generators from X we shall call positive generators, while those from X \ X 
are negative generators. Free groups are of central importance in combinato- 
rial and geometric group theory; see [T7] or [TH] for a detailed introduction. 

The identity languages of the free group F(X) and the polycyclic monoid 
P(X), with respect to the standard generating set X, are well-known in 
formal language theory. They are called the 2-sided Dyck language (or just 
Dyck language) on X and the 1- sided Dyck language (or restricted Dyck 
language or semi-Dyck language) on X respectively. The former consists 
of all words over X which can be reduced to the empty word by successive 
deletion of factors of the form xx^ 1 or x~ 1 x where x G X. The latter 
contains all words which can be reduced to the empty word by deleting only 
factors of the form xx~ l with x £ X. In particular, we see that the latter is 
(strictly) contained in the former. Thus, any word over X representing the 
identity in P(X) also represents the identity in F(X); the converse does not 
hold. 

A well-known theorem of Chomsky and Schiitzenberger states that the 
context-free languages are exactly the rational transductions of 1-sided Dyck 
languages, and of 2-sided Dyck languages [2, Proposition 2]. By Proposi- 
tion [2] this result has the following interpretation in the M-automaton set- 
ting. 

Theorem 7 (Chomsky-Schiitzenberger 1963). Let L be a language. Then 
the following are equivalent: 
(i) L is context-free; 
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(ii) L is accepted by a poly cyclic monoid [of rank 2/ automaton; 

(iii) L is accepted by a free group [of rank 2 / automaton. 

We have already remarked that the equivalence of (i) and (ii) is the usual 
equivalence of context-free grammars and pushdown automata, and so is 
well-known. It is also straightforward to show that a pushdown automaton 
can simulate a free group automaton, so that (iii) implies (ii); this is left as 
an exercise for the interested reader. The restriction to polycyclic monoids 
[respectively, free groups] of rank 2 is a simple consequence of the well-known 
fact that every polycyclic monoid [free group] of countable rank embeds into 
the polycyclic monoid [free group] of rank 2. What remains, which is the 
real burden of the proof, is to show that (i) and/or (ii) implies (iii). 

The original proof of Chomsky and Schiitzenberger starts with a context- 
free grammar, and produces from it an appropriate rational transduction; 
an example of this approach can be found in pQ. A direct group-theoretic 
proof of this result was claimed by Dassow and Mitrana [I]; however, their 
construction was fundamentally flawed [3]. A correct algebraic proof has 
recently been provided by Corson [3] , who exhibited a free group automaton 
accepting the identity language of a polycyclic monoid automaton. The 
authors of both [4J and [3] appear to have overlooked the equivalence of the 
statement to the theorem of Chomsky and Schiitzenberger. 

Theorem [7] is quite surprising, in view of our comments above regarding 
the method used by a polycyclic monoid automaton to read the rightmost 
symbol of the stack. A polycyclic monoid automaton apparently makes 
fundamental use of its ability to "fail", by reaching a zero configuration of 
the register monoid. Since a free group has no zero, a free group automaton 
seems to have no such capability, and appears to be "blind" in a much 
more fundamental way. However, it transpires that a carefully constructed 
interplay between the finite state control and the group register can achieve 
the desired "failing" effect. 

We remark also upon an interesting corollary to Theorem [JJ It is well 
known that every recursively enumerable language is a homomorphic image 
(and hence a rational transduction) of the intersection of two context-free 
languages. Hence, combining Theorems 0] and [JJ we immediately obtain the 
following result, which was first observed in the group case by Mitrana and 
Stiebe US]. 

Theorem 8. Let M be a free group of rank 2 or more, or a polycyclic 
monoid of rank 2 or more. Then F(M x M) is the class of all recursively 
enumerable languages. 

In the rest of this section, we present an alternative group- and automata- 
theoretic proof of Theorem [7J In particular, we show explicitly how a free 
group automaton can simulate the operation of a pushdown automaton. 
In the process, we also obtain some technical results relating polycyclic 
monoids to free groups, which may be of independent interest. We begin 
by introducing a construction of a free group automaton from a polycyclic 
monoid automaton, that is, a pushdown automaton. 

Suppose L C S* is the language accepted by a P(A)-automaton A with 
state set Q, that is, by a pushdown automaton with state set Q and stack 
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(x, a) 






) 



Figure 1. A pushdown automaton (left) and a free group 
or pushdown automaton (right), both accepting the 1-sided 
Dyck language on a two-letter alphabet. 

alphabet X. Let # be a new symbol not in X, and let X# denote the 
alphabet X U {#}. We construct from A an new finite automaton A' with 
edges labelled by elements of (X# xS*). It has: 
• state set Q' = Q- U Q + where 



are disjoint sets in bijective correspondence with Q; 

• start state q + where q is the start state of A; 

• final states of the form where q is a final state of A; 

• an edge from p + to g+ labelled (x#, w) whenever A has an edge from 
p to q labelled (x, w) with x a positive generator. 

• an edge from p_ to q + labelled (x'#,w) whenever A has an edge 
from p to q labelled by (x',w) with x' a negative generator; 

• an edge from p + to q + labelled (e, w) whenever A has an edge from 
p to q labelled (e,w); 

• for each q G Q, an edge from q + to q_ labelled (e, e); and 

• for each q G Q, a loop at state q- labelled (# , e). 

The automaton A' can be interpreted either as a P(X*)-automaton or as 
an -F(X^)-automaton; it transpires that the language accepted is the same 
for each choice. Figure 1 illustrates a P({x})-automaton accepting the 1- 
sided Dyck language on the alphabet {a, b}, together with the automaton 
constructed from it by the procedure above. 

In general, we make the following claim. 

Theorem 9. The free group automaton A' and the polycyclic monoid au- 
tomaton A' both accept exactly the language L. 

The rest of this section is devoted to the proof of Theorem [9l We shall 
need a number of preliminary definitions and results. 

Definition 10. Let w\, . . . , w n E X and w = w± . . . w n € X* . Let f be a 
new symbol not in X. A permissible padding of w is a word of the form 



Q+ = {<?+ I Q ^ Q} an d Q 



{q-\q£ Q} 



xix 2 ...x n {# v ) k 
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where k £ No and for each i € {1 ... n} we have 

I (# _1 ) m ^i# f° r some m £ N° if iOj is a negative generator; 
Xi = < 

I if is a positive generator. 

Thus, a permissible padding of w is obtained by inserting the symbol 
# after every generator in w, and zero or more # _1 s before each negative 
generator, and at the end of the word. 

The following lemma connects the above definition to our free group au- 
tomaton construction; it can be routinely verified. 

Lemma 11. The automaton A' accepts (x,w) if and only if there exists a 
word y G X* such that x is a permissible padding of y, and (y, w) is accepted 
by A. 

We shall also use the following straightforward lemma concerning words 
representing the identity in the free group. 

Lemma 12. Let w € X* be a word representing the identity in a free group 
F(X), and suppose w = uxv where u,v are words and x G X. Then either 
u has a suffix x~ 1 e where e represents the identity, or v has a prefix ex' 1 
where e represents the identity. 

Proof. We have seen that any word in the 2-sided Dyck language, that is, 
any word representing the identity in the free group, can be reduced to the 
empty word by successively removing factors of the form xx' 1 and x~ l x 
where x £ X. Such a reduction process for w must eventually bring the 
given occurrence of the generator x next to some occurrence by delet- 
ing the letters between them. But the product of these letters must be a 
factor representing the identity; setting e equal to this factor will give an 
appropriate factorisation of either u or v (depending upon whether the given 
occurrence of x^ 1 occurs before or after that of x). □ 

Recall that an element of the free group F(X) is called positive if it can 
be written as a product of one or more positive generators. The following 
definition facilitates a geometric interpretation of the positive elements. 

Definition 13. Let w € X and let x <G F(X). We say that £ is a minimum 
of w, if 

(i) w has a prefix representing x; and 

(ii) no prefix of w which represents x is immediately followed by a neg- 
ative generator. 

Proposition 14. Let w be a word representing the identity in F(X). Then 
the following are equivalent: 

(i) w represents the identity in the polycyclic monoid P(X); 

(ii) every prefix of w represents a positive or identity element; 

(iii) the only minimum of w is the identity of F(X). 

Proof. The equivalence of (i) and (ii) is well-known, and easily deduced from 
the definitions. 

Suppose now that (ii) holds, that is, that every prefix of w represents a 
positive or identity element. It is easily seen that the identity is a minimum 
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of w. Moreover, if x is a non-identity element represented by a prefix of w 
then consider the longest prefix of w representing x. Considering the path 
traced through the Cayley graph of F(X), and recalling that the latter is 
a tree, it is clear that the letter following this prefix must be a negative 
generator. Thus, x cannot be a minimum of w, and so (iii) is satisfied. 

Conversely, suppose that (ii) does not hold, that is, that some prefix of 
w represents a non-positive, non-identity element. Suppose further for a 
contradiction that (iii) holds, that is, that the identity is the only minimum 
of w. Let e be a non-positive, non-identity element represented by a prefix 
of w. Since e is not a minimum for w, there is a prefix u of w representing e, 
which is followed by a negative generator x^ 1 . But now ux^ 1 represents an- 
other non-positive, non-identity element. Continuing in this way, we obtain 
an infinite sequence of prefixes of u, which must clearly all represent distinct 
elements. Since w is a finite word, this gives the required contradiction. □ 

Lemma 15. Let w 6 X# be a word representing the identity in P(X^) 
and suppose w = uv. Then there exists a factorisation v = st such that 

(i) either t = e or t begins with a negative generator; and 

(ii) u#s#~H also represents the identity in P(X#). 

Proof. If u represents the identity then v also represents the identity, so it 
suffices to take s = v and t = e. 

Assume now that u does not represent the identity, and consider the path 
traced through the Cayley graph of the free group F(X^) when starting 
from the identity and reading w. Since w represents the identity in P(X#), 
it also represents the identity in F(X^), so having reached the element 
represented by u, this path must return to the identity. Since the Cayley 
graph is a tree, the path must either leave in the direction of the identity, in 
which case we take s = e, or leave away from the identity and then return 
to the element represented by u having read a word s, before leaving in the 
direction of the identity. By Proposition 1141 u represents a positive element, 
so "in the direction of the identity" means following a negative generator. 
Defining t to be such that v = st, it is now clear that s and the corresponding 
t have the desired properties. □ 

The following proposition, which may also be of interest in its own right, 
is the main step in the proof. 

Proposition 16. Let w £ X* . Then the following are equivalent. 

(i) w represents 1 in the poly cyclic monoid P{X); 

(ii) w admits a permissible padding which represents 1 in the polycyclic 
monoid P(X&); 

(iii) w admits a permissible padding which represents 1 in the free group 
F(X#). 

Proof. First suppose (i) holds. By repeated application of Lemma [T5l we 
can insert the symbol # between every pair of generators and the symbol 
in appropriate places, so as to obtain a permissible padding of w which 
represents 1 in P(X&). Thus, (ii) holds. 

Clearly every word representing the identity in P(X#) also represents the 
identity in F(X&), so that (ii) implies (iii). 
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Finally, suppose (iii) holds, and let w' be a permissible padding of w 
which represents 1 in F(X&). Suppose for a contradiction that w does not 
represent 1 in P(X). Certainly since w' represents 1 in F{X*) we must have 
that w represents 1 in F(X). So by Proposition 1141 w contains a minimum 
which is not the identity. Let u be the shortest prefix of w representing this 
minimum, and write w = uv. 

It follows that we can write w' = u'#v' where u' and v' are paddings of 
u and v respectively. Certainly, since u is the shortest prefix representing 
the given element, u has no suffix representing the identity in F(X). It 
follows that v! has also no suffix representing the identity in F(X). Hence, 
by Lemma [T2| we can write v' = e'# q 1 where e' represents the identity. 

Let q and e be the words over X obtained by deleting all occurences of 
the letters # and from q' and e' respectively. Since w' = u'j^e'j^^q 1 is 
a permissible padding of w, it follows that q = e or q begins with a negative 
letter. But we have w = uv = ueq where e represents the identity. If q = e 
then u must represent the identity, which is a contradiction. On the other 
hand, if q begins with a negative letter, then this contradicts the assumption 
that u is a minimum of w. □ 

We are now ready to complete the proof of Theorem [9l 

Proof. Suppose a word w is accepted by the pushdown automaton A. Then 
by definition, there exists a word x € X such that x represents the identity 
in P(X), and (x,w) is accepted by A when viewed as a usual finite au- 
tomaton. Now by Proposition 1161 x admits a permissible padding y which 
represents 1 in the polycyclic monoid P(X^), and hence also in the free 
group F(X&). Now by Lemma fTTT (y,w) is accepted by A' as a finite au- 
tomaton over F(X*) x £* and over P{X#) x S*. Hence, w is accepted by 
A' as both a free group automaton and a polycyclic monoid automaton. 

Conversely, if w is accepted by A' as a free group automaton [polycyclic 
monoid automaton], then by definition there exists a word y such that 
(y, w) is accepted by A' as an automaton over F(X#) x £* [respectively, 
P(X&) x £*] and y represents 1 in the free group [polycyclic monoid]. Now 
by Lemma HU y is a permissible padding of some word x, such that (x, w) 
is accepted by A viewed as a finite automaton over P(X) x E*. But by 
Proposition [TBI x represents 1 in the polycyclic monoid P{X), so that w is 
accepted by A, as required. □ 
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